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ABSTRACT 

We forecast the sensitivity with which the Murchison Widcfield Array (MWA) can measure the 21 cm 
power spectrum of cosmic hydrogen, using radiative transfer simulations to model reionization and the 
21 cm signal. The MWA is sensitive to roughly a decade in scale (wavenumbers of k ~ 0.1 — lh Mpc -1 ), 
with foreground contamination precluding measurements on larger scales, and thermal detector noise 
limiting the small scale sensitivity. This amounts primarily to constraints on two numbers: the 
amplitude and slope of the 21 cm power spectrum on the scales probed. We find, however, that the 
redshift evolution in these quantities can yield important information about reionization. We examine 
a range of theoretical models, spanning plausible uncertainties in the nature of the ionizing sources, 
and the abundance of gas-rich mini-halos during reionization. Although the 21 cm power spectrum 
differs substantially among these models, a generic prediction is that the amplitude of the 21 cm power 
spectrum on MWA scales (k ~ 0.4ft, Mpc -1 ) peaks near the epoch when the intergalactic medium 
(IGM) is rj 50% ionized. Moreover, the slope of the 21 cm power spectrum on MWA scales flattens as 
the ionization fraction increases and the sizes of the HII regions grow. Considering detection sensitivity, 
we show that the optimal MWA antenna configuration for power spectrum measurements would pack 
all 500 antenna tiles as close as possible in a compact core. Provided reionization occurs in the MWA 
observing band, this instrument is sensitive enough in its optimal configuration to measure redshift 
evolution in the slope and amplitude of the 21 cm power spectrum. Detecting the characteristic 
redshift evolution of our models will help confirm that observed 21 cm fluctuations originate from 
the IGM, and not from foregrounds, and will provide an indirect constraint on the evolution of the 
volume- filling factor of HII regions during reionization. After two years of observations under favorable 
conditions, the MWA can constrain the filling factor at an epoch when (xi) ~ 0.5 to within roughly 
±5(xi) ~ 0.1 at 2 — a confidence. It can also constrain models for the ionizing sources and the 
abundance of mini-halos during reionization. 

Subject headings: cosmology: theory - intergalactic medium - reionization - large scale structure of 
universe 



1. INTRODUCTION 

Detecting 21 cm emission from the high redshift IGM 
will provide fully three-dimensional information on the 
epoch of reionization (EoR) . Futuristic experiments like 
the Square Kilometer Array (SKA) will have the sen- 
sitivity to produce maps of reionization as a function 
of redshift (Zaldarriaga et al. 2004, McQuinn et al. 
2006). These maps, taken in various frequency bands 
and with sufficiently high angular resolution, will amount 
to a reionization 'movie': they will depict the growth of 
HII regions around individual sources and their subse- 
quent mergers with neighboring HII regions, and detail 
the completion of the reionization epoch, whereby the 
entire volume of the Universe becomes filled with ionized 
hydrogen. 

While producing a reionization movie is perhaps the ul- 
timate goal of 21 cm studies, first generation surveys like 
the MWA 4 will already provide valuable insights into the 
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reionization process. These first generation surveys will 
lack the sensitivity required to produce detailed maps, 
but will allow for a statistical detection. For example, 
the MWA is expected to measure the power spectrum of 
21 cm brightness temperature fluctuations over roughly a 
decade in scale, in each of several redshift bins (McQuinn 
et al. 2006, Bowman et al. 2006). 

In this paper, we use radiative transfer simulations 
(Sokasian et al. 2001, 2003, Zahn et al. 2007, McQuinn 
et al. 2007a, 2007b) to quantify how well power spec- 
trum measurements with the MWA can constrain the 
reionization process. How well can we deduce the gross 
features of the reionization movie from a statistical detec- 
tion? In particular, we aim to check whether the MWA 
can constrain the volume filling factor of HII regions as a 
function of time. Reliable estimates of the volume filling 
factor of HII regions - the 'ionization fraction' of the IGM 
- will pinpoint the timing and duration of reionization. 
This information, marking a key event in the formation 
of structure in our universe, will constrain models for 
the ionizing sources, and solidify existing measurements 
from cosmic microwave background (CMB) polarization 
(Page et al. 2007), quasar spectra (Fan et al. 2006, 
Lidz et al. 2006, 2007a, Becker et al. 2007, Bolton 
& Haehnelt 2007a, 2007b, Mesinger & Haiman 2007, 

of site. 



2 



Wyithe et al. 2005, Wyithe et al. 2007, Gallcrani et al. 
2007), Ly-a emitter (LAE) surveys (Malhotra & Rhoads 
2005, Kashikawa et al. 2006, Furlanetto et al. 2006c, 
Dijkstra et al. 2007, McQuinn et al. 2007b, Mesinger & 
Furlanetto 2007a), and gamma-ray burst (GRB) optical 
afterglow spectra (Totani et al. 2006, McQuinn et al. 
2007c), which currently provide tantalizing, yet contro- 
versial and subtle to interpret, clues. 

The outline of this paper is as follows. In §2 we describe 
our radiative transfer simulations, and examine the sim- 
ulated 21 cm power spectrum and its redshift evolution. 
In §3 we forecast the sensitivity with which the MWA can 
measure the 21 cm power spectrum, paying attention to 
foreground contamination, and examining how the sen- 
sitivity depends on the, as yet unfinalizcd, configuration 
of the MWA's antenna tiles (§3.3). In §4 we show that 
the MWA sensitivity generally boils down to constraints 
on each of the slope and amplitude of the 21 cm power 
spectrum at k <~ 0.1 — lh Mpc -1 . We then quantify how 
well the MWA can constrain these two numbers in dif- 
ferent redshift bins. In §5, we show how constraints on 
the slope and amplitude of the 21 cm power spectrum 
in several redshift bins translate into constraints on the 
redshift evolution of the volume-filling factor of HII re- 
gions during reionization. In §6 we summarize our main 
results and discuss future research directions. 

Throughout we consider a ACDM cosmology param- 
eterized by: n s = 1, cr 8 = 0.8, fl m = 0.27, £1a = 0.73, 
Sib = 0.046, and h = 0.7, (all symbols have their usual 
meanings), consistent with the results from Spergel et al. 
(2007). 

2. THE REDSHIFT EVOLUTION OF THE 21 CM POWER 
SPECTRUM 

In this section, we examine the redshift evolution of 
the 21 cm power spectrum using the radiative transfer 
simulations of McQuinn et al. (2007a, 2007b). We focus 
on the power spectrum since the MWA is expected to 
have limited imaging sensitivity, but can still provide a 
statistical detection of 21 cm fluctuations. First, let us 
briefly define terms. In the limit that the spin tempera- 
ture of the 21 cm transition, T5, is globally much larger 
than the CMB temperature, Tcmb, and ignoring pecu- 
liar velocities, the 21 cm brightness temperature relative 
to the CMB at spatial position x{ is: 

<5r(!ci) = T (x H )[l + <5*(£i)][l + S P {xi)]. (1) 

Here, T is the 21 cm brightness temperature, relative 
to the CMB temperature, at redshift z and frequency 
v = 1420/(1 + z) MHz, for a neutral gas element at the 

cosmic mean gas density; T = 28 [(1 + z)/l{^^ 2 mK in 
our adopted cosmology (Zaldarriaga et al. 2004). The 
quantity (xh ) is the volume-averaged neutral fraction, S x 
is the fractional fluctuation in neutral fraction, while S p 
is the fractional gas density fluctuation. We frequently 
quote results in terms of the volume-averaged ionization 
fraction, (xi) = 1 — (xh), rather than in terms of the 
neutral fraction. 

In general, we measure the power spectrum of 
8t{x~{)/To and plot the dimensionlcss power spectrum of 
this dimensionless quantity, A^fc) = fc 3 P2i(fc)/27r 2 -i.e, 
we plot the variance per logarithmic interval in wavenum- 
ber of the field 5t{x{)/T . We ignore peculiar velocities 
throughout since they have little impact during the bulk 



of the reionization epoch when ionization fluctuations 
are large (McQuinn et al. 2006, Mesinger & Furlanetto 
2007b). We consider only spherically-averaged power 
spectra since the MWA has limited sensitivity in the 
transverse direction (McQuinn et al. 2006). 

2.1. Radiative Transfer Simulations 

Let us briefly describe the simulations used in our anal- 
ysis, and our fiducial model for the ionizing sources and 
reionization history. Radiative transfer is post-processed 
on an evolved N-body simulation using the code of Mc- 
Quinn et al. (2007a), a refinement of the Sokasian et al. 
(2001, 2003) code, which in turn uses the adaptive ray- 
tracing scheme of Abel & Wandelt (2002). Some other 
approaches to large scale reionization simulations are de- 
scribed in Ilicv et al. (2006), Kohler et al. (2005), Trac 
& Cen (2006) and Croft & Altay (2007). The radia- 
tive transfer calculation is performed on top of a 130 
Mpc/h, 1024 3 particle dark matter simulation run with 
an enhanced version of Gadget-2 (Springcl 2005). The 
minimum resolved halo in this simulation is ~ 10 10 M Q , 
but smaller mass halos down to the atomic cooling mass 
(Barkana & Loeb 2001), M coo i ~ 10 8 M Q , are incorpo- 
rated with the appropriate statistical properties as in 
McQuinn et al. (2007a). Ionizing sources are placed in 
simulated halos with simple prescriptions. In our fiducial 
model, we assume that a source's ionizing luminosity is 
proportional to its host halo mass. More specifically, our 
fiducial model is similar to the 'SI' simulation of Mc- 
Quinn et al. (2007a), except that our model here was 
run in a slightly different cosmology, and in a larger (130 
Mpc/h rather than 65.6 Mpc/ft.) simulation box. Addi- 
tionally, the radiative transfer calculations (and our sub- 
sequent power spectrum measurements) are computed on 
a 512 3 grid, rather than a 256 3 grid. 

As we show subsequently, the MWA's ability to detect 
the 21 cm signal from reionization depends strongly on 
the timing and duration of this process (McQuinn et al. 
2006, Bowman et al. 2006). Presently, we are unable to 
predict reliably when reionization occurs, and how long it 
lasts. This depends on many poorly constrained factors 
such as the efficiency and initial mass function of star for- 
mation at high redshift, the dumpiness of the high red- 
shift IGM, the escape fraction of ionizing photons from 
host galaxies, the degree to which photoionizing and su- 
pernova feedback suppress star formation in low mass 
galaxies, and other uncertain physics (see, e.g., the re- 
view by Furlanetto et al. 2006a). In our fiducial model, 
reionization commences around z ~ 11.5, at which point 
~ 2% of the volume of the IGM is ionized, with 15% of 
the volume ionized by z — 8.8, 54% by z — 7.3, and 95% 
by z — 6.8. The reionization epoch may easily occur 
at significantly different redshifts than in this model and 
likely satisfy all existing constraints mentioned in the in- 
troduction, i.e., from CMB polarization, quasar spectra, 
LAE surveys, and the optical afterglow spectra of GRBs. 

However, our predictions for the power spectrum of 
ionization fluctuations at a given ionization fraction are 
more robust than our predictions at a given redshift, 
which suffer from the large uncertainties noted above. In 
particular, McQuinn et al. (2007a) demonstrated that 
the ionization power spectrum varies weakly with red- 
shift when considered in a given model at fixed ionization 
fraction, a consequence of the fact that, in most plausi- 
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Fig. 1. — Redshift evolution of the 21 cm power spectrum in 
our fiducial model. The redshifts and volume-averaged ionization 
fractions shown are ((xi),z) = (0.02, 11.46); (0.15, 8.76); (0.21, 
8.34); (0.54, 7.32); (0.82, 6.90); and (0.96, 6.77). The MWA probes 
wavenumbers between roughly k ~ 0.1 — Ih Mpc -1 . At the lowest 
ionization fraction shown ((aSj) = 0.02), the 21 cm power spectrum 
traces the density power spectrum. There is a brief equilibration 
phase (see text) where the 21 cm power spectrum amplitude drops 
on large scales near (xi) = 0.15. The amplitude on MWA scales 
subsequently increases until (an) ~ 0.5, and then falls off at higher 
ionization fractions. As the ionization fraction grows, after a brief 
steepening during the equilibration phase, the slope of the power 
spectrum on MWA scales flattens. 

ble models, the bias of the ionizing sources is a weak 
function of redshift (Furlanetto et al. 2006b). The 21 
cm power spectrum depends somewhat more strongly on 
redshift at fixed ionization fraction than the ionization 
power spectrum, since the 21 cm field explicitly involves 
the density field, which evolves in time. Nevertheless, 
this dependence is relatively weak during most of the 
EoR when the ionization fluctuations are large (§2.3), 
and one can think of our predictions in a given model as 
roughly invariant with redshift for a given ionization frac- 
tion. Moreover, we aim to extract information regarding 
the filling factor of HII regions from the observations, so 
it will be convenient to consider the signal as a function 
of this quantity. Of course, as we noted earlier, the un- 
certain mapping between ionization fraction and redshift 
can strongly impact the detectability of the signal (§3). 

2.2. Simulated Power Spectrum in our Fiducial Model 

In Figure 1, we show the spherically-averaged 21 cm 
power spectrum from our fiducial model for a range of 
redshifts and ionization fractions. At early times and low 
ionization fraction (z = 11.46, (xj) = 0.02), the 21 cm 
power spectrum simply traces the density power spec- 
trum, except on very small scales where early HII re- 
gions have some impact. At slightly later times, there is 
a brief phase (at (xi) — 0.15, 0.21, z = 8.76,8.34) where 
the large-scale 21 cm power spectrum falls below the den- 



sity power spectrum, and steepens in slope. This occurs 
because the large-scale overdense regions initially contain 
more neutral hydrogen than underdense ones, and con- 
sequently appear brighter in 21 cm. On the other hand, 
the overdense regions ionize first, and quickly transition 
to being dimmer in 21 cm than underdense ones, which 
remain neutral. This transition leads to a brief 'equili- 
bration' phase where overdense and underdense regions 
have similar brightness temperatures, and the large-scale 
21 cm power spectrum is low as a result (Furlanetto et 
al. 2004, Wyithe & Morales 2007). At these early stages 
of reionization, however, our calculations may be inac- 
curate since we neglect the impact of spin temperature 
fluctuations (Pritchard & Furlanetto 2007). 

After the brief equilibration phase, the HII regions 
quickly grow, boosting the large-scale power spectrum 
and suppressing the power on small scales. As we demon- 
strate in the next section, the wavenumbers relevant for 
the MWA are k ~ 0.1 - lh Mpc" 1 . On these scales, the 
amplitude of the 21 cm power spectrum rises rapidly from 
when the filling factor of ionized regions is (a^) = 0.21 
to when (xj) = 0.54, and then drops off at higher ion- 
ized fractions, falling rather quickly when (a;,) > 0.8. In 
conjunction with the increased power, the slope flattens 
and is close to k 3 P2i(k) cx const, for (x^ > 0.6. 

Provided that there are substantial ionization fluctu- 
ations on the scales that the MWA is sensitive to, it is 
unsurprising that the 21 cm power spectrum amplitude 
peaks around the epoch in which the IGM is w 50% 
ionized. Recall that the variance in the ionization field 
averaged on small scales, a 2 — (xf) — (xi) 2 , must re- 
duce to a 2 ~ (xi) — (xi) 2 in the limit that each pixel 
is either completely neutral or completely ionized. In 
this limit, the variance in the ionization field peaks when 
(x^ = 0.5, close to the ionization fraction at which the 
21 cm power spectrum reaches its maximum on MWA 
scales. Of course the 21 cm power spectrum is more 
complicated, since it depends on the cross-correlation be- 
tween ionization and overdensity and higher order con- 
tributions (Lidz et al. 2007b), and since a large portion 
of the ionization variance comes from scales that are not 
probed by the MWA. Nevertheless, our simple argument 
motivates why the maximum in fluctuation amplitude 
occurs near (x^ ~ 0.5. 

In summary, although the MWA may be limited to 
measuring the power spectrum over ~ a decade in scale, 
our model 21 cm power spectra evolve considerably with 
redshift and ionization fraction over this range. Hence, 
sensitivity only to a decade in scale may still be quite 
valuable, provided the MWA can make measurements 
over a number of redshift intervals. 

2.3. Model Dependence 

How robust to model uncertainties is the rise and fall in 
21 cm power spectrum amplitude, and the flattening in 
power spectrum slope, at increasing ionization fraction? 
In order to check this, we examine the amplitude and 
slope of the 21 cm power spectrum for two other cases 
that roughly bracket model uncertainties. McQuinn et 
al. (2007a) investigated the many uncertain physical pa- 
rameters that can impact reionization and found that 
the nature of the ionizing sources and the abundance 
of mini-halos have the strongest influence on the ioniza- 
tion power spectrum. If the sources are rare but very 
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Fig. 2. — Amplitude and slope of model 21 cm power spectra 
as a function of ionization fraction. Bottom: Amplitude of the 21 
cm power spectrum, at the pivot wavenumber (k p = OAh Mpc -1 ) 
for MWA observations in our fiducial (solid red line), rare source 
(blue short-dashed line), and mini-halo models (black long-dashed 
line), plotted as a function of ionization fraction. The green dot- 
dashed line shows, for contrast, the amplitude of the density power 
spectrum obtained by mapping redshift to ionization fraction as in 
our fiducial model. Top: Slope of the 21 cm power spectrum at 
the pivot wavenumber for our three models, as well as the density 
power spectrum slope. The slight tilting of the slope of the density 
power spectrum with decreasing redshift (increasing (xt)) owes to 
quasi-linear effects. The slope and amplitude of the 21 cm power 
spectrum vary considerably among the different models at a given 
ionization fraction. The behavior with ionization fraction across 
the different models is, however, relatively generic: the amplitude 
of the 21 cm power spectrum reaches a maximum close to the 
epoch when ~ 50% of the volume of the IGM is ionized, and the 
slope flattens with increasing ionization fraction. The maximum 
amplitude is reached at slightly lower ionization fraction in our 
massive source model. 

luminous and lie predominantly in rather massive ha- 
los, then the high clustering of the ionizing sources leads 
to larger HII regions at a given ionization fraction, and 
the ionization power spectrum is peaked on larger scales 
than in our fiducial model. On the other hand, if mini- 
halos are abundant, the HII regions are smaller, and the 
ionization power spectrum is peaked on smaller scales 
than in our fiducial model (Furlanetto & Oh 2005, Mc- 
Quinn et al. 2007a). The cross-correlation between ion- 
ization and overdensity is also less significant with rare, 
efficient sources, which acts in conjunction with the in- 
creased ionization power in these models to boost the 21 
cm power spectrum relative to our fiducial model (Lidz 
et al. 2007b). 

As a representative case with rare sources and large 
bubbles, we use a model similar to 'S3' of McQuinn et 
al. (2007a), except run in our present larger volume. 
Briefly, the S3 model has source luminosity L oc M 5 / 3 : 
hence, the most massive, highly clustered halos produce 
most of the ionizing photons. Our representative case 
with abundant mini-halos and small bubbles, adds mini- 



halos of dark matter halo mass M > 10 5 M Q into the sim- 
ulation with the appropriate statistical properties (Mc- 
Quinn et al. 2007a) . Each mini-halo is given a cross sec- 
tion to ionizing photons initially equal to its halo virial 
radius and we subsequently assume that each mini-halo 
is photo-evaporated on a sound-crossing time. This is a 
relatively extreme situation since in reality the mini-halo 
cross sections will shrink with time before they become 
completely photo-evaporated (Shapiro et al. 2004, Ilicv 
et al. 2005), and since many mini-halos may be photo- 
evaporated by pre-heating prior to reionization (Oh & 
Haiman 2003). 

We then determine the amplitude and slope of the 21 
cm power spectrum in each model, and plot these quan- 
tities as a function of ionization fraction. The results of 
this calculation are shown in Figure 2. As a convenient 
choice, we consider the amplitude and slope of the 21 
cm power spectrum at a wavenumber k p = OAh Mpc -1 , 
which is close to the middle of the range of wavenumbers 
probed by the MWA (see §4) . We refer to call this as the 
'pivot wavenumber' since we center our power law fits in 
§4 on this value. 

Concentrating first on the bottom panel of Figure 2, 
it is clear that the 21 cm power spectrum amplitude at 
fixed ionization fraction differs significantly among the 
different cases. The rare source model has the largest 
amplitude fluctuations, the mini-halo model the smallest 
ones, and our fiducial model has an intermediate level 
of fluctuations - precisely the trends anticipated above. 
Note that, despite the presence of ionization fluctuations, 
the amplitude of the 21 cm power spectrum at the pivot 
wavenumber is generally less than that of the density 
power spectrum in our models. This is a consequence of 
the ionization-density cross correlation and higher order 
terms (Lidz et al. 2007b). However, the relative am- 
plitudes of the 21 cm power spectrum and the density 
power spectrum are somewhat sensitive to the choice of 
pivot wavenumber. The density power spectrum falls off 
more rapidly than the 21 cm power spectrum towards 
large scales, and hence the 21 cm power spectrum be- 
comes larger than the density power spectrum for smaller 
wavenumbers (see Figure 1). The basic trend of 21 cm 
amplitude with redshift or ionization fraction, however, 
is similar for most wavenumbers within the MWA band. 

Although the signal differs significantly between mod- 
els, the 21 cm power spectrum amplitude rises and falls 
with increasing ionization fraction in each case. The 
slight dip in power spectrum amplitude near 15% ionized 
fraction owes to the equilibration phase discussed earlier. 
Each model reaches a maximum amplitude rather close 
to an ionization fraction of ~ 50%, although the max- 
imum in the massive source model occurs at a slightly 
lower ionization fraction, ~ 40%. The basic trend is very 
encouraging: the redshift at which the 21 cm power spec- 
trum reaches its maximum may provide an observational 
signature of the redshift at which the IGM is ~ 50% ion- 
ized. The ionization fraction at which the 21 cm power 
spectrum amplitude reaches a maximum depends some- 
what on our choice of pivot wavenumber (see Figure 1), 
although results are similar near the middle of the range 
of wavenumbers probed by the MWA. 5 

5 Wyithe & Morales (2007) find that the variance of the 21 
cm field, smoothed with a cylindrical top-hat of 10' angular scale, 
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Let us now turn our attention to our model predic- 
tions for the slope of the 21 cm power spectrum on MWA 
scales, shown in the top panel of Figure 2. In our fidu- 
cial model, the 21 cm power spectrum is rather fiat with 
k 3 P(k) <~ const, at the end of reionization. This is in 
contrast to the density power spectrum, which has a 
slope of k 3 P(k) ~ fc ' 8 near our pivot wavenumber. A 
similar flattening is seen in the massive source case, but 
here the flattening occurs at smaller ionization fraction 
since this model produces large ionized regions even at 
rather low ionization fractions. The slope of the power 
spectrum even goes slightly negative after > 40% of the 
volume is ionized. On the other hand, in our mini-halo 
model, the ionized regions are smaller at a given ion- 
ization fraction, and there is consequently less flattening 
with increasing ionization fraction. The slight steepening 
seen at early times in our models, near (a;,) ~ 0.15, again 
owes to the equilibration phase discussed in the previous 
section. 

If the MWA finds that the slope of the power spectrum 
is significantly flatter than that of the density power spec- 
trum, this immediately argues for the presence of large 
ionized regions, and implies that reionization is well un- 
derway, while detecting fluctuations at all clearly implies 
that less than 100% of the IGM volume is ionized. 6 In 
the mini-halo model, there is less steepening, and hence 
detecting a relatively steep 21 cm power spectrum slope 
by itself does not imply that one is probing an early phase 
of reionization, as it would for our other cases. It is clear 
from Figure 2, however, that combining measurements 
of the 21 cm power spectrum amplitude and slope in a 
few conveniently placed redshift bins will help move be- 
yond a mere detection of 21 cm fluctuations and allow 
constraints, albeit indirect ones, to be placed on the ion- 
ization fraction as a function of time. 

Moreover, given the observational challenges antici- 
pated for 21 cm observations and the possibility that un- 

reaches a minimum around (a:,) ~ 0.5. This angular scale corre- 
sponds to R ~ 19 Mpc/ft and k ~ 1/R ~ 0.05ft Mpc' 1 at z = 8 
in our cosmology, (although there is some ambiguity in the pro- 
portionality factor relating k and 1/-R). Their cylindrical filter will 
pass fluctuations on still larger scales, and so this is not necessarily 
in contradiction with our results which find that the power spec- 
trum on somewhat smaller scales is maximal near (xi) = 0.5. The 
disadvantage of their approach is that foreground cleaning will pro- 
hibit measuring the large-scale modes passed by their cylindrical 
filter. Their calculation methodology is also accurate on scales only 
much larger then the sizes of individual HII regions, which may im- 
pact their predictions during the middle phase of reionization when 
HII regions arc already quite large. 

6 Recently, Wyithe & Loeb (2007) pointed out that neutral gas in 
damped Ly-a (DLA) absorbers will produce a 21 cm signal even af- 
ter essentially the entire volume of the IGM is reionized. This signal 
will likely become comparable in amplitude to that from the diffuse 
IGM only at the very end of reionization and can potentially be 
distinguished from the diffuse IGM on the basis of its power spec- 
trum shape. Specifically, wc expect the DLA contribution_to the 21 
cm power spectrum to have the form P2i,DLA(fc) ~ %\ m b 2 Pg t s(k), 
where x^ m is the mass-averaged neutral gas fraction locked up in 
DLAs, b is the mass-averaged DLA bias, and Pg $(k) is the matter 
power spectrum. For plausible numbers of ii m = 0.05, and 5 = 3, 
the DLA contribution is a factor of 5 smaller than the diffuse IGM 
contribution in our fiducial model at (xi) = 0.96 and k p ~ 0.4ft 
Mpc -1 . The diffuse component is even more dominant at large 
scales, and less dominant on smaller scales. Hence, we expect the 
DLA contribution to kick in essentially after reionization. The 
MWA should therefore be able to study each of these interesting 
signals separately. 



desirable residuals survive the foreground cleaning pro- 
cess, simple theoretical diagnostics such as the ones men- 
tioned here are extremely valuable. Observing the power 
spectrum amplitude rise and fall with redshift, and the 
power spectrum slope flatten, is characteristic of the an- 
ticipated reionization signal, but foreground residuals are 
unlikely to mimic this behavior. 

3. MWA POWER SPECTRUM SENSITIVITY 

We now consider the statistical significance at which 
the MWA might detect this characteristic redshift evolu- 
tion in the 21 cm power spectrum. We first write down 
the equations describing statistical error estimates for the 
21 cm power spectrum (Zaldarriaga et al. 2004, Morales 
2005, McQuinn et al. 2006). We generally follow the 
notation of Furlanetto & Lidz (2007). 

The variance of a 21 cm power spectrum estimate for 
a single fc-mode with line of sight component fcii = (ik, 
restricting ourselves to modes in the upper-half plane, 
considering both sample variance and thermal detector 
noise, and assuming Gaussian statistics, is given by: 
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The first term in Equation (2) is the contribution from 
sample variance, while the second term comes from ther- 
mal noise in the radio telescope. The thermal noise term 
depends on the system temperature, T sys , the co-moving 
distance to the center of the survey at redshift z, D(z), 
the survey depth, Al?, the observed wavelength, A, the 
effective area of each antenna tile, A e , the survey band- 
width, B, the total observing time, ii nt , and the distri- 
bution of antennae. The dependence on antenna config- 
uration is encoded in n(k±) which denotes the number 
density of baselines observing a mode with transverse 
wavenumber k± (McQuinn et al. 2006). The factor of 
Tq in the noise term is appropriate because we consider 
the error in the power spectrum of 5t/Tq (Equation 1). 

In order to estimate the variance of the power spectrum 
averaged over a spherical shell of logarithmic width e = 
dink, we add the statistical error for individual fc-mode 
estimates in inverse quadrature: 



1 



= E 



Ek 3 V sl 



1 



4tt 2 



(3) 



In this equation, V^ urve y = D 2 AD (A 2 /^4 e ) denotes the 
effective survey volume of our radio telescope, and the 
sum is over all modes in the upper half plane contained 
within the survey volume. The maximum fi included in 
the sum over modes is set by the survey depth, up to 
a maximum possible of fi = 1, while the minimum /i 
depends on the highest transverse wavenumber sampled 
by the array, down to a minimum possible of \i — 0. In 
practice, we approximate this sum by an integral. We 
generally plot the error in A 2 , 1 (fc) which is related to the 
above ap(k) by a factor of k 3 /2tt 2 . 

Making the array more compact increases the number 
density of baselines, n(fcj_), sampling low kj_, but trun- 
cates the sum over modes (Equation 3) at larger /i m ; n . 
An important point to note is that, owing to high-fc line 
of sight modes in the sum of Equation (3), the MWA 
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can still estimate the power spectrum for wavenumbers 
with fc > fc_L, roax > where fe_L, max is the highest transverse 
wavenumber probed by the antenna array. Indeed, for 
the rather compact array configurations expected for the 
MWA, the sensitivity is concentrated along the line of 
sight, and the sum over /z in Equation (3) is restricted to 
a limited range close to fi = 1 for high fc modes. 

3.1. Assumed MWA Survey Parameters 

The MWA will be an array of N a = 500 antenna tiles 
observing a wide field on the sky of ~ 800 deg 2 at fre- 
quencies of 80 — 300 MHz, corresponding to 21 cm emis- 
sion redshifts of z = 4 — 17. Each antenna tile consists 
of 16 dual polarization dipoles layed out in a 4 m-by-4 m 
grid, and covers an effective collecting area of A e = 14 
m 2 at z = 8 (Bowman et al. 2006). We linearly in- 
terpolate between the values of effective area given in 
Table 2 of Bowman et al. (2006) to determine the effec- 
tive antenna areas at other redshifts. We assume that 
the system temperature is set by the sky temperature, 
which we take to be T sys = 280((1 + z)/7.5) 2 3 K, fol- 
lowing Wyithe & Morales (2007). We consider obser- 
vations over a bandwidth of B = 6 MHz, and a total 
observing time of t- m t = lj 000 hrs. This integration time 
represents an optimistic estimate for the total amount 
of usable observing time the MWA might achieve within 
a single calendar year. The bandwidth is chosen to be 
small enough to ensure that the signal evolves minimally 
over the corresponding redshift interval (McQuinn et al. 
2006), which is only Az = 0.3 at z = 8. 

We will consider two choices for the antenna distribu- 
tion, since the sensitivity of the array depends strongly 
on the configuration of the tiles (McQuinn et al. 2006, 
Bowman et al. 2006, §3.3). In the first configuration, 
following Bowman et al. (2006), we assume that a frac- 
tion of the 500 antenna tiles are packed as close as physi- 
cally possible within a 20 m core, and that the remaining 
antennae follow an r~ 2 distribution out to a maximum 
baseline of 1.5 km. This configuration has ~ 80 anten- 
nae in the compact 20 m core. In addition, we consider a 
configuration where all 500 antennae are packed as close 
as physically possible within a ~ 50 m core, which we 
will term the 'super-core' distribution. For each config- 
uration, we calculate the approximate number density 
of baselines observing a given transverse mode, n(fcj_), 
by determining the auto-correlation function of the an- 
tenna distribution. In practice, one may not be able to 
place the antennae as close as physically possible, since, 
e.g., the tiles will interfere with one another when packed 
too tightly. Nevertheless, our examples represent useful 
models to understand how sensitivity depends on array 
configuration. 

3.2. Results 

We estimate the statistical error bars from our fiducial 
model 21 cm power spectrum calculations (Figure 1), 
using the MWA survey specifications, and Equations (2) 
and (3), and considering spherical shells of logarithmic 
width e = 0.5. Note that our hypothetical surveys extend 
to much higher wavenumbers along than transverse to 
the line of sight, and so our surveys do not sample full 
spherical shells in fc-space at high wavenumber - they 
only sample a limited fraction of a fc-space shell close to 
the line of sight axis. 
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Fig. 3. — 21 cm power spectrum sensitivity for the MWA at 
different redshifts for our fiducial model. The red curves show the 
theoretical model 21 cm power spectra from our fiducial model 
(from Figure 1) at different redshifts and ionization fractions, as 
labeled. The red error bars indicate statistical error estimates for 
the MWA with 1,000 hours of observation, and an r~ 2 antenna 
distribution. The smaller blue error bars show the expected sta- 
tistical error bars for the MWA with antennae arranged in the 
super-core configuration (see text). Foreground cleaning over our 
assumed 6 MHz bandwidth prohibits power spectrum measure- 
ments on wavenumbers smaller than indicated by the black dashed 
lines. 



In this section, our calculations are similar to previ- 
ous work (McQuinn et al. 2006, Bowman et al. 2006), 
except here we consider a more realistic model for the 
21 cm power spectrum. The results of these calculations 
are shown in Figure 3 for four example redshifts and 
ionized fractions. The dashed lines in the figure indi- 
cate wavenumbers below which foreground removal - for 
our particular choice of bandwidth, B = 6 MHz - will 
prohibit power spectrum measurements. The foreground 
cleaning process exploits the expectation that the fore- 
grounds are spectrally smooth, while the signal has struc- 
ture in frequency space (e.g. Zaldarriaga et al. 2004, 
Morales & Hewitt 2004, Morales et al. 2006, McQuinn 
et al. 2006). This process will - at the very minimum - 
remove all line of sight modes with k\\ < 2ir/AD. The 
discreteness of modes in the survey then implies that all 
modes with fc < 2ir/AD will be lost in the foreground 
cleaning process. At high wavenumber, the antenna ar- 
ray samples modes poorly owing to its limited angular 
resolution, and hence power measurements at wavenum- 
ber above fc > lh Mpc -1 are quire noisy. 

Measurements early in the EoR generally have low sig- 
nal to noise. This is illustrated in the top two panels 
of Figure 3. The sensitivity of the measurements de- 
grades significantly towards high redshift because of the 
increasing sky brightness towards low frequency, T sys oc 
(1 + z) 2 ' 3 , and because the signal is weaker on large scales 
at early times. In this particular model, the z = 8.76, 



(xi) = 0.15 measurement actually has less signal to noise 
than the higher redshift z = 11.46, (x/) — 0.02 model. 
This is because the z — 8.76 output is near the equilibra- 
tion phase (see §2.2), where the large scale power actu- 
ally drops below the density power spectrum. At lower 
redshifts, during the intermediate phase of reionization, 
the MWA will make highly significant power spectrum 
measurements over a range in scales, as illustrated by 
the lower left panel where z = 7.32, (xj) = 0.54. Indeed, 
even when the IGM is 96% ionized at z = 6.77 in this 
model, 21 cm fluctuations should be detectable with the 
MWA in its r~ 2 arrangement, with a strong detection 
expected for the super-core configuration. 

To provide a quantitative measure, we calculate the 
total S/N at which the MWA can measure the 21 cm 
power spectrum. We compute the total S/N by taking 
the square root of the sum of the squares of the S/N 
in each fc-bin. For our r~ 2 antenna configuration, this 
calculation gives ~ 1 — a, 0.5 — er, 24 — a, and 13 — a 
at ({Xi),z) = (0.02, 11.46); (0.15, 8.76); (0.54, 7.32) and 
(0.96, 6.77) respectively. In the super-core configuration, 
these numbers are boosted to ~ 4 — a, 2 — a, 39 — a and 
25 — o respectively. Taken at face value, these numbers 
imply that the MWA, in its super-core configuration, can 
achieve a 3 — a power spectrum detection when the IGM 
is 50% neutral after only tj nt ~ 100 hrs of integration 
time. Note, however, that most of the detection sensi- 
tivity comes from the first fc-bin beyond the foreground 
limit. Given that this first bin is potentially most im- 
pacted by residual foregrounds (McQuinn et al. 2006), 
and given that the discriminating power between models 
is slightly larger at higher fc, we also quote the S/N at 
which one can measure power in the fc-bin with fc = 0.46ft. 
Mpc -1 (and e = eflnk = 0.5), near the middle of the 
range of scales probed by the MWA. In the super-core 
configuration, the S/N for detecting power in this fc-bin 
is 0.6 — <7, 0.8 — a, 8 — <r, and 2 — a for the respective 
((xi), z) above (see also §4). 

To get a sense for how the power spectrum signal to 
noise depends on when reionization occurs, we repeat our 
sensitivity calculations for models in which (x^ — 0.54 
at each of z = 6.77, 9.76 and z — 11.46. We con- 
trast these models with our earlier calculations in which 
(x^ — 0.96, 0.07, 0.02 at these respective redshifts. In or- 
der to avoid running additional radiative transfer calcu- 
lations, we construct 21 cm fields at these redshifts using 
in each case the ionization field in our fiducial model 
at z = 7.32 (which has {x/} — 0.54), in conjunction 
with the simulated density field at each desired redshift. 
This should be a good approximation since the ionization 
power in a given model depends mainly on the ionization 
fraction and not explicitly on redshift (McQuinn et al. 
2007a). 

The results of this calculation are shown in Figure 4. 
Comparing with the results of Figure 3, we see that the 
21 cm power spectrum sensitivity at a given redshift is 
enhanced significantly when {x/) ~ 0.5 compared to the 
sensitivity at the beginning or end of reionization. The 
large ionized bubbles during the middle phase of reion- 
ization significantly boost the amplitude of the 21 cm 
power spectrum on large scales, and facilitate detection 
above the thermal detector noise. Quantitatively, the to- 
tal S/N at which the MWA can detect the 21 cm power 
spectrum at z = 11.46 in our fiducial model (in which 
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Fig. 4. — 21 cm power spectrum sensitivity for the MWA at dif- 
ferent redshifts with (xi) ~ 0.5 at each redshift. Similar to Figure 
3 except here we show how the sensitivity depends on redshift for 
a fixed ionization fraction of (xj) ~ 0.5. 



case (xi) = 0.02) is only 1 — a for the r~ 2 antenna con- 
figuration, and 4 — a for the super-core distribution. If 
(x/) ~ 0.5 at this redshift, however, the S/N increases to 
5 — a in the r™ 2 configuration, and 14 — a for the super- 
core arrangement. For comparison, the S/N at which the 
MWA can detect power in the fc = 0.46ft. Mpc -1 bin is 
1.5- cr, 3.4-cr and 9.9-er when (xi) = 0.54 at z = 11.46, 
9.76, and 6.77 respectively. Interestingly, despite the in- 
creased sky noise towards high redshift, the MWA can 
still expect a detection at z ~ 11 if (x/) ~ 0.5 at this 
redshift. 

It is also interesting to examine how strong a lower 
limit on the ionization fraction the MWA might place 
from a null detection. If the IGM is mostly ionized 
at z = 8, for example, should we expect the MWA 
to detect 21 cm power at this redshift? To investi- 
gate this, we use the ionization fields from our fiducial 
model at (x^ — 0.82, 0.96 to produce mostly ionized 21 
cm power spectrum models at z = 8.15. Interestingly, 
with our fiducial survey parameters these models both 
yield S> 3 — a power spectrum detections in each of the 
r~ 2 and super-core configurations, provided we include 
all modes with wavenumber larger than our foreground 
cut. For comparison, the S/N for detecting power in 
the fc = 0.46ft. Mpc -1 bin alone is larger than 3 — a only 
for the (x^ — 0.82 model in the super-core configuration. 
The super-core configuration produces only a 1 — a power 
spectrum detection in this bin at (x-i) = 0.96, z — 8.15, 
while neither {x/} model produces a significant detection 
in this bin with the r~ 2 antenna configuration. Hence if 
the MWA fails to detect power at z < 8, this will still 
provide a stringent lower limit on (xj). Since much of 
the power spectrum sensitivity comes from fc-bins close 
to the foreground limit, however, the precise lower limit 
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Fig. 5. — Density of baselines observing a given mode (or vis- 
ibility it) for our two model antenna configurations at Z = 7.32. 
The density of baselines drops off rapidly towards large visibili- 
ties for both arrangements. The super-core configuration provides 
superior sampling of the small visibilities. 

depends on how well foreground cleaning algorithms per- 
form for wavenumbers close to the foreground cut (Mc- 
Quinn et al. 2006). This is an important topic for further 
study. 

3.3. Sensitivity to Array Configuration 

As noted earlier (Zaldarriaga 2004, McQuinn et al. 
2006, Bowman et al. 2006), the sensitivity of 21 cm 
power spectrum measurements depends strongly on the 
array's antenna tile configuration. Here we expand on 
this point, advocating a still more compact antenna dis- 
tribution than previous authors. 

Concentrating on the fiducial model output at z = 7.32 
where the S/N of the detection is highest, it is clear that 
the super-core configuration yields significantly smaller 
statistical error bars. At k = O.lh Mpc -1 the statistical 
error bars are a factor of 1.3 smaller in the super-core con- 
figuration than for the r -2 distribution, but by k = lh 
Mpc -1 , the error bars are a factor of ~ 4 smaller in the 
super-core configuration. The gain in sensitivity at low k 
is less substantial because the large scale modes become 
sample variance dominated, and hence making the array 
more compact beyond some point does not further boost 
power spectrum sensitivity. Interestingly, the fact that 
large scale modes become sample variance dominated in 
the super-core configuration, implies that the MWA is ca- 
pable of imaging large scales modes in this arrangement 
- i.e., the signal to noise per mode is unity for some large 
scale modes. We will consider the MWA's imaging ca- 
pabilities in future work. Here we focus on the array's 
sensitivity for statistical power spectrum measurements. 

The increased sensitivity of the array in its compact 
core configuration arises because the long baselines in the 
r~ 2 distribution are too sparsely sampled to be useful. 



To illustrate this, we plot the number density of baselines 
observing a given visibility (see Equation 2) for each an- 
tenna configuration (see also Bowman et al. 2006). The 
visibility |u| is related to the transverse wavenumber by 
2it\u\ = k±D(z). Integrating n(|u|) over all u gives the 
total number of antenna pairs in the array. 

The r~ 2 antenna distribution is relatively flat at low 
u owing to this configuration's ~ 20m core region where 
the antennae are stacked as close as physically possible, 
while it falls off steeply towards high u. Note that the 
thermal noise contribution to the power spectrum vari- 
ance scales as ap(k,ff) oc l/n(k±) (Equation 2), and so 
the rapidly diminishing number of antenna pairs towards 
high u implies a correspondingly rapid increase in the 
thermal noise. Indeed, by u = 85 the number density of 
baselines falls by an order of magnitude from its value at 
the smallest visibilities sampled, and the thermal noise in 
these high fcj_ modes becomes prohibitive. In our super- 
core configuration, the antennae are packed as close as 
possible, doing away with the long baselines entirely, yet 
significantly increasing the sensitivity to low k± modes. 

Note that the limited resolution of our array config- 
urations in the transverse direction does not prohibit 
measuring high-A: modes entirely, owing to the array's 
superior line of sight resolution (Morales 2005, Bowman 
et al. 2006, McQuinn et al. 2006). Instead, our sum 
over modes in Equation (3) is simply restricted to modes 
close to the line of sight, i.e. to modes with fi larger 
than /u m i n = [1 - k 2 ^ max /k 2 ] 1 ^ 2 . While this prohibits 
probing the full angular structure of the 21 cm power 
spectrum, this is not a big limitation during most of the 
EoR when peculiar velocities have little impact and the 
power spectrum is essentially isotropic. Our calculations 
hence argue that the optimal antenna configuration for 
power spectrum measurements places all 500 antennae 
as close as physically possible in a compact core. 

3.4. Array Configuration Trade-offs 

There are of course some tradeoffs involved in making 
the array more compact. For one, in order to calibrate 
the array, high angular resolution is needed to detect 
bright point sources above the confusion noise from un- 
resolved sources. Let us make a rough estimate for how 
stringent these requirements are. 

We assume that in order to calibrate the array one 
needs to detect iVbright sources above confusion and ther- 
mal detector noise after integrating for a duration of 
tint hours. In practice, the MWA likely requires of or- 
der Albright ~ a few hundred bright sources for antenna 
calibration, and needs to be able to detect these bright 
sources on time scales of tint ~ tens of seconds. The num- 
ber of bright sources required for antenna calibration de- 
pends on the total number of antennae, and the number 
of calibration parameters per antenna. The calibration 
timescale is set by the timescale over which ionospheric 
distortions vary. We leave these quantities as free param- 
eters in our calculation to show how calibration require- 
ments scale with the number of bright sources required, 
and the calibration timescale. 

For these calculations, we adopt the Di Matteo et al. 
(2002) model for radio source counts. This model is 
based on radio counts from the 6C catalog at 151 MHz 
(Hales et al. 1988), with an extrapolation to low source 
flux. The differential source count, dn/dS, is given by 
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Fig. 6. — Requirements for detecting bright point sources and 
calibrating the array. Top: The angular resolution, expressed as 
a baseline length, needed to detect Alight sources above confu- 
sion noise from unresolved sources at 10 — a significance. Bottom: 
The red solid line is the level of confusion noise as a function of 
the number of bright sources needed for calibration purposes. The 
resolution, A8, is adjusted (as in the top panel) to allow for a 
10 — <t detection of each point source. The blue dashed lines show 
the thermal detector noise for 7V a j ong = 10, 100, and 500 anten- 
nae at long baselines (from top to bottom), plotted as a function 
of observing time (top horizontal axis). Approximately 50 — 100 
antennae at long (~ lkm) baselines suffice to detect a few hun- 
dred point sources above the thermal and confusion noise, after 
integrations of t- lnt ~ tens of seconds. 



dn/dS = 4(5*/S'o)- 1 - 75 sources mJy- 1 star 1 , for S < S Q 



and dn/dS = 4(S*/5 ) 



-2.51 



sources mJy str 



for 

S > S , with So = 880 mJy. 7 The number of bright 
sources detectable by the MWA above some limiting min- 
imum flux, S m i B , across the entire MWA field of view, 
An, is: N(> S min ) = An J^, n dSdn/dS. The Poisson 
noise from unresolved sources below the detection limit 
is 8 : 



(S 2 



dSS 2 dn/dS 



(4) 



The resulting confusion noise, given an MWA pixel of 

1/2 

angular size A6, is ~ [(S 2 )(A6) 2 ] ' . In order to detect 
a point source of flux S'mm above the confusion noise 
from unresolved sources, we would like the flux from this 
source to be say 10 times as large as the confusion noise, 

SW [<5 2 )(A6») 2 ] 1/2 = 10. This requirement demands 

7 The notation in Di Matteo et al. (2002) is ambiguous, and their 
model for the differential source counts has been misinterpreted by 
Gnedin & Shaver (2004). The differential source count given above 
matches the bright end measurements of Hales et al. (1988), and 
is continuous across Sq where the slope flattens. 

8 Using the model of Di Matteo et al. (2002), we find that 
the variance owing to the clustering of unresolved radio sources is 
much smaller than the Poisson noise for the bright source cuts and 
angular scales of interest. We hence neglect clustering here. 



high angular resolution: one needs relatively good angu- 
lar resolution in order to beat down the confusion noise 
from the unresolved sources. 

We investigate the angular resolution required to over- 
come confusion noise from unresolved point sources in 
Figure 6. First, we determine the minimum flux, 5*1111111 
above which the MWA can detect iVb r i g ht point sources 
in its entire field of view. We then solve for the angular 
resolution A9 such that the confusion noise from point 
sources with S < S m i n is less than O.lS'min - i.e., we re- 
quire that the source is detectable above the confusion 
noise at 10 — a significance. The angular resolution satis- 
fying this constraint, AO, corresponds to a baseline with 
length L max ~ X bs/A9. The figure illustrates that, in 
order to detect a few hundred point sources above the 
confusion noise - roughly the number of bright point 
sources required for MWA array calibration - one re- 
quires baselines of order ~ 1 km. Hence at least some 
antennae are required beyond the super-core configura- 
tion, which provides a maximum baseline of only ~ 100 
m. 

In order to determine how many antennae are needed 
at long baselines, we want to demand that the thermal 
detector noise of the large baseline antennae is much less 
than the source flux, S m i n , that we aim to detect. Let 
us take -/V a .i ong antennae at long baselines. The detector 
noise for point source detection, with N^\ ong antennae, 
each of effective area A e , integrating for tj nt hours over 
a bandwidth B is: 



5thcrm 



2k B T s , 



(5) 



We plot the thermal noise in the bottom panel of Figure 
6 for a bandwidth of B — 6 MHz, and A e — 14 m 2 (ap- 
propriate for the MWA at z = 8; Bowman et al. (2006)), 
as a function of ij nt for several different values of N at i ons . 
On timescales of ~ tens of seconds, the thermal noise ex- 
pected for ~ 100 antennae at long baselines is much less 
than both the confusion noise and the minimum source 
flux required to detect a few hundred bright sources. This 
suggests that while some antennae are required at large 
baselines, the requirements are rather modest. In partic- 
ular, only 50— 100 long baseline antennae are required by 
our simple estimate, leaving 400 — 450 antennae for our 
super-core configuration. This is conservative, since we 
have effectively treated the antennas in the core as a sin- 
gle antenna in considering the calibration requirements. 
Our proposed arrangement is in marked contrast to the 
suggested r~ 2 distribution, in which only ~ 80 antennae 
are in a compact core. 

From our simple estimates, it appears that the super- 
core, or a similar configuration, is feasible since only a 
handful of long baselines appear necessary for array cal- 
ibration. This should, of course, be tested with detailed 
simulations of the MWA pipeline, performed for a va- 
riety of antenna distributions. While we have argued 
that the super-core is optimal for spherically-averaged 
21 cm power spectrum estimates, it may not be optimal 
for other programs. Hcliosphcric science, and surveys 
for radio transients for example, will naturally require 
high angular resolution and favor less compact antenna 
configurations. Other EoR science projects may be im- 
pacted as well. For example, a compact-core is not ideal 
for detecting the 21 cm-galaxy cross power spectrum: 
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here one needs to balance the MWA's high sensitivity 
along the line of sight, but poor transverse sensitivity, 
with the galaxy survey's poor line of sight sensitivity - 
owing to uncertainties in photometric redshifts - yet su- 
perior transverse sensitivity (Furlanetto & Lidz 2007). 
The MWA program to image quasar HII regions (Geil & 
Wyithe 2007) may also suffer from reduced angular res- 
olution. In practice, the MWA might start with an r~ 2 
configuration, or a slightly more compact arrangement, 
and gradually add antennae into a compact core. 



4. CONSTRAINING THE POWER SPECTRUM AMPLITUDE 
AND SLOPE 

Figure 3 suggests that the MWA will mainly be sensi- 
tive to the amplitude and the slope of the 21 cm power 
spectrum at each of several redshifts. Constraints on just 
these two numbers, in several redshift bins, are still quite 
interesting: we showed in §2.3 that the 21 cm power spec- 
trum amplitude and slope evolve with redshift in a rela- 
tively generic and informative manner. Let us consider 
how well we can determine the parameters of a power law 
fit to the MWA power spectrum measurements in several 
redshift bins. 

In practice, the MWA will be limited by large data 
rates to handling 32 MHz intervals of bandwidth at a 
time. This 32 MHz interval can be sampled in any man- 
ner desired from the full spectral range covered by the 
instrument, 80-300 MHz - e.g., one could take 20 MHz 
centered around z ~ 6 and 12 MHz around z ~ 10, or one 
could take a contiguous 32 MHz stretch, etc. One can 
further sub-divide this 32 MHz into smaller intervals for 
foreground-cleaning and power spectrum analyses. The 
best choice of frequencies to analyze depends, of course, 
on when reionization occurs. 

For illustrative purposes, let us start with a 32 con- 
tiguous MHz bin centered fortuitously onz = 7.4, when 
reionization is roughly 50% complete in our model. We 
discuss less fortunate choices subsequently. We divide 
this up into five 6 MHz redshift bins in which we sepa- 
rately estimate power spectrum errors, and discard the 
remaining 2 MHz in our analysis. We consider observing 
this 32 MHz stretch for a year, assuming, as above, that 
this yields 1, 000 hours of usable data. This stretch of 32 
MHz corresponds to a redshift range of z = 6.7 — 8.2. 

In addition, we consider observations of a further con- 
tiguous stretch of 32 MHz, expanding our redshift cover- 
age out to z = 10.4. Again, we assume that this higher 
redshift stretch is observed for t- m t = 1,000 hours. The 
high redshift stretch might be observed roughly simul- 
taneously with the lower redshift band - e.g., one day 
observing at low redshift followed by a day at high red- 
shift - or one might observe consecutively for a year at 
low redshift, followed by a year at high redshift, etc. Ei- 
ther way, in total our estimates amount, optimistically, 
to two years of observing. 

We then estimate error bars for the observing strategy 
described above, using the formulas in §3 for each of the 
redshift bins considered here. Further, we fit a polyno- 
mial function to lnA| 1 (k) in ln(k), truncating the fit at 
the lowest order that yields an unbiased estimate of the 
amplitude and slope of the power spectrum at the pivot 
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Fig. 7. — Sensitivity of the MWA to redshift evolution in the 
power spectrum amplitude and slope. For illustration, in each 
panel we consider the sensitivity of the MWA, observing for one 
year (yielding 1, 000 usable hours) over a contiguous stretch of 30 
MHz, centered around a redshift of z = 7.4 (square points), and 
broken into five 6 MHz bins. We additionally consider a separate 
year of observations devoted to an adjacent contiguous 30 MHz 
stretch, again in 6 MHz bins, but focused on higher redshifts (tri- 
angles). Bottom: Redshift evolution in the power spectrum am- 
plitude at the MWA pivot wavenumber. The red points show the 
expected mean power spectrum amplitude and error bars in our 
fiducial model for the MWA with an r~ 2 antenna distribution, 
while the blue points show forecasts for the MWA in the super- 
core configuration. Top: Identical to the bottom panel, except for 
the slope of the power spectrum. The slope and amplitude errors 
are correlated (see text). The MWA has the sensitivity, in the 
super-core configuration, to detect the power spectrum amplitude 
rise and fall in this model, and the slope flatten, with increasing 
redshift. 

wavenumber, A| 1 (fc = k p ): 

N P 

mAli (k) = \nA 2 21 (k = k p ) + £ aj [ln(k/k p )] j . (6) 

j=i 

Since the 21 cm power spectrum is not a perfect power 
law on MWA scales (Figure 1), it is important that we 
leave our fit reasonably general. We would like to en- 
sure that our estimates of the amplitude and slope of 
the 21 cm power spectrum are unbiased: do the best fit 
amplitude and slope change by more than the error bars 
as we include more parameters in our fit? Further, we 
want to check whether the MWA is able to detect ad- 
ditional parameters in the generalized polynomial fit - 
e.g., a 'running' in the power law slope. We estimate the 
parameters A 2 (fc = k p ), (Xj, and their errors assuming 
Gaussian statistics, and using standard maximum likeli- 
hood estimates. We fit to the simulation data given our 
power spectrum error estimates. We use all scales smaller 
than prohibited by foreground contamination, and re- 
strict ourselves to k < lh Mpc -1 , since smaller scales 
are too noisy to yield useful information. 

We adopt k p = OAh Mpc -1 as the pivot wavenumber 
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for our fit (Equation 6). This wavenumber is close to 
the middle of the range of scales probed, and represents 
a convenient choice. Note, however, that the sensitivity 
is a strong function of wavenumber and the power spec- 
trum is constrained more tightly by the data on large 
scales than on small scales (Figure 3). This implies that 
estimates of the power spectrum amplitude and slope are 
correlated for our choice of k p , in contrast to the tradi- 
tional choice of pivot scale in which amplitude and slope 
errors are uncorrelated. Provided we account for corre- 
lations in the amplitude and slope estimates, our final 
constraints on the ionization fraction (§5) are, however, 
independent of k p . The wave number at which the slope 
and amplitude errors are uncorrelated will depend on the 
foreground cut wavenumber, and on redshift. We hence 
prefer to plot results in the middle of the range of scales 
probed by the MWA, where the amplitude is a partic- 
ularly strong function of ionization fraction, keeping in 
mind the error correlations. Moreover, note that the to- 
tal S/N for 21 cm power spectrum detection is larger 
than the significance at which the MWA can estimate 
the power spectrum amplitude at this particular scale 
(see §3.2), owing to the strong scale dependence of ther- 
mal noise. 

The results of our calculation are shown in Figure 7, for 
each of an r~ 2 antenna distribution, and the super-core 
configuration. In each redshift bin, as we detail shortly, 
we choose the minimum number of parameters in our fit 
(Equation 6) that yields an unbiased estimate of the 21 
cm power spectrum amplitude and slope. Considering 
first the r~ 2 antenna distribution, the bottom panel of 
the figure shows that the MWA, in this configuration, 
can only weakly detect evolution in the power spectrum 
amplitude at k p = QAh Mpc -1 in our fiducial model. 
Still more marginal is the ability of the MWA, in the r~ 2 
arrangement, to detect redshift evolution in the slope 
of the 21 cm power spectrum (top panel): one can al- 
most draw a straight line through all of the error bars in 
this figure. The MWA, in the super-core configuration, 
however, enables significant detections of redshift evolu- 
tion in both the slope and amplitude of the 21 cm power 
spectrum. As remarked previously, the slope and ampli- 
tude errors are significantly correlated. For example, the 
correlation coefficient between our amplitude and slope 
estimates is r = 0.7 at z = 7.4 in the super-core con- 
figuration, and r = 0.9 at the same redshift for the r~ 2 
antenna configuration. 

Before more closely considering the insights that can 
be gleaned from detecting this redshift evolution, let us 
briefly return to the question of the number of parame- 
ters required. We fit our model power spectra with plau- 
sible MWA error-bars in the super-core configuration, 
using a polynomial fit with each of N p = 1,2, and 3. 
Similar considerations apply for the r~ 2 antenna distri- 
bution. The results of these fits are shown in Figure 8: 
the red squares show fits with N p = 1 included, the blue 
squares show fits with N p — 2, and the green squares 
show fits with N p — 3. The top panel shows constraints 
on power spectrum amplitude, the middle panel shows 
constraints on the slope, and the bottom panel shows 
constraints on the 'running' of the slope, cti. 

We judge our estimate of a parameter 'unbiased' if the 
preferred value of that parameter changes by less than 
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Fig. 8. — Number of parameters constrained by MWA power 
spectrum measurements in the super-core configuration. We as- 
sume that lnA| 1 (k) is a polynomial in ln(k) and examine how 
many terms in the polynomial expansion the MWA can constrain. 
Top: Constraints on the amplitude of the 21 cm power spectrum 
at k = k p . The red points show constraints assuming that A|j(fc) 
is a power law in k over MWA-detected scales. The blue points 
include a quadratic fitting term in ln(k), while the green points 
include a cubic term in ln(k). Middle: Similar for the slope of the 
power spectrum at k = k v . Bottom: Constraints on the running 
of the slope at k = k p . At high redshift, fitting a mere power 
law results in unbiased estimates of the amplitude and slope. At 
moderate redshift, where the sensitivity is superior, yet there is 
still enough neutral hydrogen around to produce a significant 21 
cm signal (l + z = 8.1 — 9 in our model), one must include a run- 
ning in the fit to ensure unbiased estimates of the power spectrum 
amplitude and slope. In these redshift bins, there appears to be 
a constraint on the running, ct2, but it is biased, as one can see 
by comparing the running estimates for N p = 2 (blue points) and 
N p = 3 (green points). Hence, although we include 012 in the fit 
for these redshift bins, this only ensures unbiased constraints on 
the slope and amplitude. 

1 — a when an additional parameter is included in the fit. 
For example, our estimate of the power spectrum ampli- 
tude near l+z ~ 8 changes by more than one sigma when 
we move from N p — 1 to N p — 2, and we consider it bi- 
ased. Likewise, estimates of the slope of the 21 cm power 
spectrum are biased in several redshift bins for N p = 1. 
For the slope and the amplitude, estimates converge in 
all redshift bins by N p = 2. In each redshift bin we 
include the minimum number of parameters required to 
provide an unbiased estimate of the amplitude and slope, 
so we include only parameters up to N p = 1 in some red- 
shift bins, while we include parameters up to N p = 2 
in others. In some of our most tightly constrained red- 
shift bins there initially appears to be a constraint on ol^. 
Including an additional parameter in the fit, 03, shows 
that our constraint on a2 is biased for N p — 2. Hence, 
while we include «2 as a parameter in some redshift bins, 
we only use our constraints on the amplitude and slope, 
since only these parameters are unbiased. 
Provided the MWA packs enough antennae into its 
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core, it does have the sensitivity to detect redshift evo- 
lution in the slope and amplitude of the 21 cm power 
spectrum, if not higher terms in the polynomial fit of 
Equation (6). While our forecasts are inevitably ideal- 
ized, the constraints are tight enough that we expect a 
reasonable detection of redshift evolution for our fiducial 
model in the super-core configuration. 

Of course, reionization may be more extended and/or 
occur at higher redshift than in our fiducial model. The 
calculations of §3 give some insight into the impact of 
the timing of reionization on MWA detection sensitivity. 
For example, Figure 4 implies that we still expect a sig- 
nificant detection of 21 cm fluctuations if reionization is 
already 50% complete at the highest redshift probed by 
our hypothetical MWA survey. In this scenario, however, 
the MWA would observe only the falling half of our an- 
ticipated trend of 21 cm power spectrum amplitude with 
redshift, and miss the earlier phase in which we expect 
the 21 cm power spectrum amplitude to grow with de- 
creasing redshift. Moreover, reionization may be more 
extended than in our fiducial model and lead to a more 
gradual power spectrum maximum when fluctuations are 
measured as a function of redshift. 



5. THE FILLING FACTOR OF HII REGIONS FROM POWER 
SPECTRUM MEASUREMENTS 

Let us turn our error estimates on the 21 cm power 
spectrum amplitude and slope into constraints on the 
filling factor of HII regions during reionization. As an 
illustration, we focus on power spectrum measurements 
in the specific redshift bin where the amplitude is max- 
imal - i.e., our input 'true' model is our fiducial model 
with (xi) = 0.5. We compare our simulated signal in 
this redshift bin, including MWA statistical error esti- 
mates, with fiducial model predictions at different ion- 
ization fractions. We compute Ax 2 between the differ- 
ent models, and calculate the likelihood that the data are 
drawn from a fiducial model with a given ionization frac- 
tion, assuming Gaussian statistics, L — exp(— Ax 2 /2). 
We assume in these calculations that our model 21 cm 
power spectra are entirely fixed by (a;,) and ignore the 
weak redshift dependence expected. Since our estimates 
of the amplitude and slope are correlated, we use the full 
co- variance matrix to compute A% 2 . 

We assume our fiducial model to calculate power spec- 
tra at different ionization fractions, ignoring model un- 
certainties in 21 cm power spectra at a given ionization 
fraction. This is justified - at least for the model of 
ionization fraction evolution we assume here - since the 
MWA can rule out the rare-source and mini-halo models 
by comparing measurements over the full redshift range. 
Specifically, we translate the power spectrum amplitude 
and slope measurements of Figure 7 from measurements 
as a function of redshift to measurements as a function of 
ionization fraction, assuming our fiducial model to con- 
vert between redshift and ionization fraction. We then 
compare our mock 21 cm power spectrum amplitude and 
slope measurements with the rare source and mini-halo 
models, computing A% 2 between these models and our 
fiducial model. We find that the MWA can distinguish 
between these models at very high significance - both 
the mini-halo and rare source model are ruled out at 
^> 3 — a confidence even with the r~ 2 antenna configura- 
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Fig. 9. — Ability of the MWA to constrain the ionization frac- 
tion in our fiducial model. The curves show calculations of the 
likelihood that an MWA measurement of the amplitude and slope 
of the 21 cm power spectrum, at a redshift where the power spec- 
trum amplitude is maximal, is drawn from a model with a given 
(volume- weighted) ionization fraction. The red solid curve assumes 
an r~ 2 antenna distribution and uses measurements of both the 
power spectrum amplitude and slope. The narrower blue solid 
curve shows the anticipated constraint for the MWA in the super- 
core configuration, using information from both the slope and the 
amplitude of the 21 cm power spectrum. The blue dashed curve 
is similar but shows the likelihood function obtained from the am- 
plitude alone. The black horizontal dotted line shows the 2 — a 
likelihood level. 

tion. 9 This means that the MWA can break degeneracies 
between the different models, and we can get approxi- 
mate error estimates on the ionization fraction from our 
fiducial model alone. 

The results of our ionization fraction likelihood calcu- 
lations are shown in Figure 9. The MWA, observing in 
the r~ 2 configuration, will provide a modest constraint 
on the ionization fraction at maximal amplitude: ion- 
ization fractions between (xj) = 0.4 — 0.75 are allowed 
at 2 — (7 confidence. The constraint is only modest be- 
cause the slope and amplitude measurements have rather 
large error bars in this configuration (Figure 9), and be- 
cause the amplitude and slope are fairly flat functions of 
ionization fraction around the peak (Figure 2). The con- 
straint is tighter at low ionization fraction than one might 
naively guess from eye-balling the results of Figure 2, 
however, since the slope and amplitude errors are corre- 
lated. In particular, data drawn from our low ionization 
fraction models will tend to have a smaller amplitude, 

9 In detail, we should marginalize over the uncertain (xi)(z) 
relation - which is what we ultimately aim to constrain - when 
constraining the rare source and mini-halo models. For simplicity, 
we assume the fiducial (xi)(z) relation in comparing with the rare 
source and mini-halo models, and find that these models are very 
strongly ruled out. From Figure 2, we believe these constraints are 
strong enough that they would not be alleviated by marginalizing 
over (xi)(z). 
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yet a steeper slope, than data drawn from our (xi) = 0.5 
model. Properly accounting for correlations between our 
slope and amplitude estimates then increases the con- 
straint on low ionization fraction models, compared to 
ignoring these correlations. 

On the other hand, the MWA observing in the super- 
core configuration provides a bit tighter constraint. Only 
(a;,) = 0.45 — 0.65 is allowed at 2 — a confidence. Com- 
paring the blue solid and dashed curves in the figure 
illustrates that most of the constraint comes from the 
amplitude measurement and not the slope measurement. 
Provided the middle phase of reionization occurs in the 
MWA observing band, a couple of years of observations 
should constrain (a;,) around (xi) — 0.5 to within roughly 
±5(xi) ^ 0.1 at 2 — o confidence. 

6. CONCLUSIONS 

In this paper, we considered the sensitivity of the MWA 
for 21 cm power spectrum measurements and examined 
the resulting insights into reionization. Rather gener- 
ically, the 21 cm power spectrum on MWA scales in- 
creases in amplitude with decreasing redshift, as HII re- 
gions grow and boost the level of large-scale fluctuations 
in the 21 cm signal, until the time at which ~ 50% of the 
volume of the IGM is ionized. Subsequently, later in the 
EoR, the amplitude of the 21 cm power spectrum drops 
as neutral hydrogen becomes scarce. In conjunction, the 
slope of the 21 cm power spectrum flattens as HII regions 
grow and the 21 cm power spectrum transitions, on the 
scales of interest, from simply tracing density fluctua- 
tions to tracing fluctuations in the ionization field. 

These generic features imply that, although MWA 
measurements will be limited to a dynamic range of ~ 
a decade in scale, the experiment can significantly con- 
strain reionization by comparing measurements in sev- 
eral redshift bins. In particular, the MWA may detect 
the power spectrum amplitude rise with decreasing red- 
shift, before subsequently turning-over and dropping in 
amplitude, with the slope of the power spectrum flat- 
tening in conjunction. This behavior is a signature of 
the IGM passing through a redshift where ~ 50% of its 
volume is filled with ionized bubbles. 

It is unlikely that residual foreground contamination 
would share the characteristic redshift evolution in power 
spectrum amplitude and slope found in our models. Mea- 
suring this characteristic redshift evolution will hence 
help confirm that detected 21 cm fluctuations originate 
from the high redshift IGM, and cement the case for neu- 
tral material in the high redshift IGM. 

We argued that the MWA is sensitive enough to detect 
this characteristic redshift evolution in the 21 cm power 
spectrum amplitude and slope, especially if it adopts a 
very compact configuration for its antenna tiles, and that 
reionization occurs at sufficiently moderate redshifts. A 
compact configuration optimizes power spectrum sensi- 
tivity because proposed configurations, with antennae 
distributed out to large ~ 1 km distances, populate long 
baselines too sparsely to be useful. Some antenna tiles 
are needed at long baselines for point source detection 
and antenna calibration, but we estimate that these re- 
quirements are not severe. 

Another possible advantage of the super-core config- 
uration relates to its well-behaved beam. A potential 
difficulty for 21 cm observations is that point sources 



far from the primary beam of an interferometer can en- 
ter through the beam's sidelobes, which are frequency 
dependent. This contamination will have structure in 
frequency space, and escape the usual foreground re- 
moval strategies (e.g. Oh & Mack 2003, Furlanetto et 
al. 2006a). This contamination will be significantly re- 
duced by adopting a very compact antenna distribution, 
like the super-core configuration, which will have a very 
clean beam with minimal sidelobes. 

Adopting a sufficiently compact antenna distribution, 
we believe that the MWA can move past a mere detec- 
tion of 21 cm fluctuations, and constrain the volume fill- 
ing factor of HII regions within two years of observing. 
In particular, measurements of the 21 cm power spec- 
trum amplitude and slope at a redshift where 50% of the 
volume is ionized translate into 2 — a allowed ionization 
fractions of roughly (xi) = 0.45 — 0.65. Moreover, we find 
that the MWA can rather easily distinguish between our 
fiducial model and each of our rare source and mini-halo 
models with 2,000 hours of data, at least if the bulk of 
reionization occurs in the range of redshifts probed by 
the MWA. 

In this paper, we focused on the 21 cm power spec- 
trum, but investigating other statistical measures would 
be interesting. One of the primary goals of reionization 
studies is to use observational measures to constrain the 
sizes and filling factor of HII regions during reionization. 
Clearly the ionization field, and the 21 cm signal, are 
highly non-Gaussian in the EoR, implying that there is, 
in principle, more information than contained in the 21 
cm power spectrum alone. We have argued that the red- 
shift evolution of the 21 cm power spectrum contains 
relatively robust information regarding the filling factor 
of HII regions during reionization, but these constraints 
may not be optimal and are rather indirect. On the other 
hand, it is unclear how beneficial higher order statistics 
will be in the low S/N regime relevant for the MWA and 
other first generation surveys. We plan further investiga- 
tion regarding the utility of various non-Gaussian statis- 
tical measures, considered as a function of instrumental 
sensitivity. Particularly interesting is that some low- A; 
bins become sample-variance dominated in the super- 
core configuration, implying that the MWA can actually 
image large scale modes in this configuration. 

Another useful endeavor would be to perform a mock 
MWA simulation, incorporating thermal noise, fore- 
ground models, and the MWA instrumental response and 
observing strategy. Our forecasts are inevitably simpli- 
fied in ignoring these details, and likely optimistic in this 
regard. Mock simulations should quantify how much 
the sensitivity of the MWA is degraded when incorpo- 
rating such real world details. Moreover, it would be 
interesting to examine how possible residuals from in- 
complete foreground cleaning might bias our constraints. 
Since the power spectrum variance increases strongly 
with wavenumber, the total S/N at which the MWA can 
detect the 21 cm power spectrum depends strongly on 
the precise foreground cut. This is an important issue 
for further investigation. 

Finally, in this paper we focused on a single parameter, 
the filling factor of ionized regions at different redshifts, 
and gave simple arguments for how one can constrain this 
quantity with the redshift evolution of the 21 cm power 
spectrum alone. A complementary approach for forecast- 
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ing MWA parameter constraints would be to perform a 
multi-parameter Fisher matrix analysis. 

We anticipate that the MWA will swiftly move beyond 
a mere detection of 21 cm emission from the high redshift 
IGM, and obtain valuable insights regarding the filling 
factor of HII regions at different stages of reionization. 
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